Common patterns in word level prosody
نویسندگان
چکیده
The task of generating natural human-sounding prosody for text-to-speech (TTS) has historically been one of the most challenging problems that researchers and developers have had to face. TTS systems have in general become infamous for their “robotic” intonations. This paper describes an approach to this problem which endeavors to capture as much detail as possible from speech data, but in a way that avoids the “black boxes” typical of neural networks and some vector clustering algorithms. Unlike these latter methods, our approach may give feedback as to exactly what the crucial parameters are that determine the successful choice of pattern. Focusing on the notion of prosody templates, we confirmed that a representative F0 and duration pattern can be extracted based on stress pattern for a target proper noun which occurs in sentence-initial position.
منابع مشابه
Semantic Prosody: Its Knowledge and Appropriate Selection of Equivalents
In translation, choosing appropriate equivalent is essential to convey the right message from source-text to target-text, and one of the issues that may have a determinative role in appropriate equivalent choice is the semantic prosody (SP) behavior of words and the relation existing between the SP of a word and semantic senses (i.e. negativity, positivity or neutrality) of its collocations in ...
متن کاملSemantic Prosody: Its Knowledge and Appropriate Selection of Equivalents
In translation, choosing appropriate equivalent is essential to convey the right message from source-text to target-text, and one of the issues that may have a determinative role in appropriate equivalent choice is the semantic prosody (SP) behavior of words and the relation existing between the SP of a word and semantic senses (i.e. negativity, positivity or neutrality) of its collocations in ...
متن کاملThe prosody of contrastive topics in Southern Swedish
This paper presents a pilot study on the prosodic marking of a contrastive topic in Southern Swedish. A test sentence was elicited in three experimental conditions: initial focus; final focus; contrastive topic (initial word) plus focus (final word). F0 patterns were analysed in recordings of 10 speakers. A majority of the speakers distinguished clearly between the conditions, but speakers empl...
متن کاملRunning head: PRECEDING PROSODY INFLUENCES LEXICAL INTERPRETATION Expectations from preceding prosody influence segmentation in online sentence processing
Previous work examining prosodic cues in online spoken-word recognition has focused primarily on local cues to word identity. However, recent studies have suggested that utterance-level prosodic patterns can also influence the interpretation of subsequent sequences of lexically ambiguous syllables (Dilley & McAuley, 2008; Dilley, Mattys, & Vinke, 2010). To test the hypothesis that these distal ...
متن کاملStress, duration, and intonation in Arabic word-level prosody
This paper presents the results of a study of the expression of word-level prosody in Jordanian Arabic. The study focuses on the durational, spectral, and fundamental frequency correlates of stress and word-"nal juncture in the speech of four speakers. Speakers exhibit extensive "nal lengthening e!ects and a smaller e!ect of stress and penultimate lengthening. Stress lengthening correlates with...
متن کاملA Systematic Review of Hindi Prosody
Prosody describes both form and function of a sentence using the suprasegmental features of speech. Prosody phenomena are explored in the domain of higher phonological constituents such as word, phonological phrase and intonational phrase. The study of prosody at the word level is called word prosody and above word level is called sentence prosody. Word Prosody describes stress pattern by compa...
متن کامل